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\Q I An erasure channel with a fixed alphabet size q, where q ^ 1, is studied . It is proved that over any erasure 

channel (with or without memory), Maximum Distance Separable (MDS) codes achieve the minimum probability 
of error (assuming maximum likelihood decoding). Assuming a memoryless erasure channel, the error exponent of 
MDS codes are compared with that of random codes and linear random codes. It is shown that the envelopes of all 
these exponents are identical for rates above the critical rate. Noting the optimality of MDS codes, it is concluded 
that both random codes and linear random codes are exponentially optimal, whether the block sizes is larger or 
■ smaller than the alphabet size. Q 

IT) • I. Introduction 

O 

Erasure channels with large alphabet sizes have recently received significant attention in networking 
>• ■ applications. Different erasure channel models are adopted to study the performance of end-to-end con- 
nections over the Internet [1], [2]. In such models, each packet is seen as a q = 2 b -ary symbol where b 
is the packet length in bits. In this work, a memoryless erasure channel with a fixed, but large alphabet 
size is considered. The error probability over this channel (assuming maximum-likelihood decoding) for 
Maximum Distance Separable (MDS) and random codebooks are compared and shown to be exponentially 
identical for rates above the critical rate. 

Shannon [3] was the first who observed that the error probability for maximum likelihood decoding 
of a random code (P^ml) can be upper-bounded by an exponentially decaying function with respect to 
the code block length N. This exponent is positive as long as the rate stays below the channel capacity, 
R < C . Following this result, tighter bounds were proposed in the literature [4]-[6]. For rates below the 
critical rate, modifications of random coding are proposed to achieve tighter bounds [7]. Interestingly, the 
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exponential upper-bound on P^ml remains valid regardless of the alphabet size q, even in the case where 
q is larger than the block size N (e.g. see the steps of the proofs in [6]). There is also a lower-bound 
on the probability of error using random coding which is known as the sphere packing bound [8]. For 
channels with a relatively small alphabet size (q <C N), both the sphere packing lower-bound and the 
random coding upper-bound on the error probability are exponentially tight for rates above the critical 
rate [9]. However, the sphere packing bound is not tight if the alphabet size, q, is comparable to the 
coding block length N (noting the terms oi(N) and o 2 (iV) in [8]). 

Probability of error, minimum distance, and distance distribution of random linear codes are discussed 
in [10], [11]. Pierce studies the asymptotic behavior of the minimum distance of binary random linear 
codes [10]. Error exponent of random linear codes over a binary symmetric channel is analyzed in [11]. 
Barg et al. also study the minimum distance and distance distribution of random linear codes and show 
that random linear codes have better expurgated error exponent as compared to random codes for rates 
below the critical rate [11]. 

Maximum Distance Separable (MDS) [12] codes are optimum in the sense that they achieve the largest 
possible minimum distance, d min , among all block codes of the same size. Indeed, any codeword in an 
MDS code of size [N, K] can be successfully decoded from any subset of its coded symbols of size K 
or more. This property makes MDS codes suitable for use over erasure channels like the Internet [1], [2], 
[13]. However, the practical encoding-decoding algorithms for such codes have quadratic time complexity 
in terms of the code block length [14]. Theoretically, more efficient (O (N\og 2 N)) MDS codes can be 
constructed based on evaluating and interpolating polynomials over specially chosen finite fields using 
Discrete Fourier Transform [15]. However, in practice these methods can not compete with the quadratic 
methods except for extremely large block sizes. Recently, a family of almost-MDS codes with low 
encoding-decoding complexity (linear in length) is proposed and shown to provide a practical alternative 
for coding over the erasure channels like the Internet [16]. In these codes, any subset of symbols of 
size K(l + e) is sufficient to recover the original K symbols with high probability [16]. Fountain codes, 
based on the idea of almost-MDS codes with linear decoding complexity, are proposed for information 
multicasting to many users over an erasure channel [17], [18]. 

In this work, a memoryless erasure channel with a fixed, but large alphabet size is studied. First, it is 
proved that MDS block codes offer the minimum probability of decoding error over any erasure channel. 
Then, error exponents of MDS codes, random codes, and linear random codes for a memoryless erasure 
channel are analyzed and shown to be identical for rates above the critical rate. Combining the two results, 
we conclude that both random codes and linear random codes are exponentially as good as MDS codes 
(exponentially optimal) over a wide range of rates. 



Fig. 1. Erasure memoryless channel model with the alphabet size q, probability of erasure n, and the erasure symbol £. 

The rest of this paper is organized as follows. In section HH the erasure channel model is introduced, 
and the assumption of large alphabet sizes is justified. Section [III] proves that MDS codes are optimum 
over any erasure channel. Error exponents of MDS codes, random codes, and linear random codes over 
a memoryless erasure channel are compared in section [IV] Finally, section [V] concludes the paper. 

II. Erasure Channel Model 

The memoryless erasure channel studied in this work has the alphabet size q and the erasure probability 
7r (see Fig. [[]). The alphabet size q is assumed to be fixed and large, i.e., q ^> 1. 

The described channel model occurs in many practical scenarios such as the Internet. From an end to 
end protocol's perspective, performance of the lower layers in the protocol stack can be modeled as a 
random channel called an Internet channel. Since each packet usually includes an internal error detection 
mechanism (for instance a Cyclic Redundancy Check), the Internet channel can be modeled as an erasure 
channel with packets as symbols [19]. If each packet contains b bits, the corresponding channel will have 
an alphabet size of q = 2 b which is huge for typical packet sizes. Therefore, in practical networking 
applications, the block size is usually much smaller than the alphabet size. Algebraic computations over 
Galois fields ¥ g of such large cardinalities is now practically feasible with the increasing processing 
power of electronic circuits. Note that network coding schemes, recently proposed and applied for content 
distribution over large networks, have a comparable computational complexity [20]-[26]. 

Note that all the known MDS codes have alphabets of a large size (growing at least linearly with the 
block length N). Indeed, a conjecture on MDS codes states that for every linear [N, K] MDS code over 
the Galois field ¥ q , if 1 < K < q, then N < q + 1, except when q is even and K = 3 or K = q — 1, for 
which N < q + 2 [27]. To have a feasible MDS code over a channel with the alphabet size q, the block 
size N should satisfy N < q + 1. 
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III. Optimality of MDS Codes over Erasure Channels 



Maximum Distance Separable (MDS) codes are optimum in the sense of achieving the largest possible 
minimum distance, d min , among all block codes of the same size [12]. The following proposition shows 
that MDS codes are also optimum over any erasure channel in the sense of achieving the minimum 
probability of decoding error. 

Definition I. An erasure channel is defined as the one which maps every input symbol to either itself 
or to an erasure symbol £. More accurately, an arbitrary channel (memoryless or with memory) with the 
input vector x e X N , \X\ = q , the output vector y e {X U {C})^, and the transition probability p (y|x) 
is defined to be erasure iff it satisfies the following conditions: 

1) p (yj {xj, £}| Xj) = 0, V j, where xj, yj, and e 3 - denote the j'th elements of the vectors x, y, and 



p(e|x) is independent of x. 
Proposition I. A block code of size [N, K] with equiprobable codewords over an arbitrary erasure 
channel (memoryless or with memory) has the minimum probability of error (assuming optimum, i.e., 
maximum likelihood decoding) among all block codes of the same size if that code is Maximum Distance 
Separable (MDS). 

Proof. Consider a [N, K, d] codebook C with the g-ary codewords of length N, number of code-words 
q K , and minimum distance d. The distance between two codewords is defined as the number of positions 
in which the corresponding symbols are different (Hamming distance). A codeword x e C is transmitted 
and a vector y e (X U {£}) N is received. The number of erased symbols is equal to the Hamming weight 
of e denoted by w(e). An error occurs if the decoder decides for a codeword different from x. Let us 
assume that the probability of having a specific erasure pattern e is P{e} which is independent of the 
transmitted codeword (depends only on the channel). We assume a specific erasure vector e of weight 
m. The decoder decodes the transmitted codeword based on the N — m correctly received symbols. We 
partition the code-book, C, into q N ~ m bins, each bin representing a specific received vector satisfying 
the erasure pattern e. The number of codewords in the i'th bin is denoted by b e (i) for % — 1, ...,q N ~ m . 
Knowing the erasure vector e and the received vector y, the decoder selects the bin i corresponding to 
y. The set of possible transmitted codewords is equal to the set of codewords in bin i (all the codewords 
in bin % are equiprobable to be transmitted). If b e (i) = 1, the transmitted codeword x can be decoded 
with no ambiguity. Otherwise, the optimum decoder randomly selects one of the b e (i) > 1 codewords in 



e. 



2) Defining the erasure identifier vector e as 
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the bin. Thus, the probability of error is 1 — when bin i is selected. Bin i is selected if one of the 
codewords it contains is transmitted. Hence, probability of selecting bin i is equal to ^jr. Based on the 
above arguments, probability of decoding error for the maximum likelihood decoder of any codebook,C, 
is equal to 

N 

P C e,ml=J2 E P{e}F{error|e} 

m=d e:w(e)=m 
N q N- m 

=E E p w E 

m=d e:w(e)=m i=l, b e (i)>0 

2=E E P(e) (l - | 

m,=cf e:ui(e)=m 

I'f E P{e} (, _ ^Y" ] ) 

m=d e:w(e)=m 

where 6+ indicates the number of bins containing one or more codewords, (a) follows from the fact that 
the transmitted codeword can be uniquely decoded if the number of erasures in the channel is less than 
the minimum distance of the codebook, and (b) follows from the fact that Y^=i b e (i) = q K . (c) is true 
since 5+ is less than both the total number of codewords and the number of bins. 

According to P% ml * s minimized for a code-book C if two conditions are satisfied. First, the 
minimum distance of C should achieve the maximum possible value, i.e., d = N — K + 1. Second, we 
should have 6+ = q N ~ m for all possible erasure vectors e with any weight d < m < N . Any MDS 
code satisfies the first condition by definition. Moreover, it is easy to show that for any MDS code, 
we have b e (i) = q K - N+m , We first prove this for the case of m = N — K. Consider the bins of an 
MDS code for any arbitrary erasure pattern e,w(e) = N — K. From the fact that d = N — K + 1 and 
Y^ii=ibe(i) = Q K , h is concluded that each bin contains exactly one codeword. Therefore, there exists 
only one codeword which matches any K correctly received symbols. Now, consider any general erasure 
pattern e, w(e) = m > N — K. For the i'th bin, concatenating any K — N + m arbitrary symbols to 
the N — m correctly received symbols results in a distinct codeword of the MDS codebook. Having 
qK-N+m possibilities to expand the received N — m symbols to K symbols, we have b e (i) = q K - N + m . 
This completes the proof ■ 

Remark I. Proposition I is valid for any iV and 1 < K < N. However, it does not guarantee the 
existence of an [N, K] MDS code for all such values of N and K. In fact, as stated in section UH a 
conjecture on MDS codes states that for every linear [N, K] MDS code over the Galois field ¥ q , we have 
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N < q + 1 in most cases. Moreover, based on the Singleton bound, the inequality in (Q~|) can be written as 

N / n N-m\ 

p c e,m L > E E f h (i- V • (2) 

m= N-K+l e:w(e)=m ^ ' 

Interestingly, this lower-bound is valid for any codebook C of size [N, K], whether an MDS code of that 
size exists or not. 

Corollary I. For N < q + 1, converse of Proposition I is also true if the following condition is satisfied 

Ve G {0,£} N : P{e} > (3) 

Proof. For N < q+ 1 and 1 < K < N, we know that an MDS code of size [N, K] does exist (an [N, K] 
Reed-Solomon code can be constructed over ¥ q , see [28]). Let us assume the converse of Proposition I is 
not true. Then, there should be a non-MDS codebook, C, with the size [N, K,d], d < N — K + 1, which 
achieves the minimum probability of error (Pj ML = P^ml)- F° r an Y erasure vector e' with the weight 
w(e') = N — K, we can write 



*-*i,(a\—M-.Jf V ^ / 



e:w(e)=A r -A' 

< E E p w t 1 -, 

m=d e:ui(e)=m ' 

E E f m I 1 

m=d e:ui(e)=m 

+ E E p H (1-^-1+ 

m=N —K+l e:w(e)=m 



q' 



W pC nMDS (f) n ,as 

— r E,ML r E,ML ~ U y*) 

where (a), (6), and (c) follow from the fact that 6+ < minlg^ - " 1 , q K } if w(e) = m. (d) and (e) are 
based on ([T]) and the assumption that Pe ml = Peml- Combining © and © results in = g^. Thus, 
we have 6 e '(0 = 1 for all 1 < i < g A and any e' with the weight of w(e r ) = N — K. 

On the other hand, we know that the minimum distance of C is d. Thus, there exist two codewords Ci 
and c 2 in C with the distance of d from each other. We define the vector ei 2 as follows 

JO if ci = c 2 
eia = < (5) 
I 1 otherwise. 

It is obvious that w(e 12 ) = d < N — K. Then, we construct the binary vector e* by replacing enough 
number of zeros in ei 2 with ones such that w(e*) = N — K. The positions of these replacements can be 
arbitrary. In the binning corresponding to the erasure vector e*, both ci and c 2 would be in the same bin 
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since they have more than K symbols in common. However, we know that b e *(i) = 1 for all 1 < i < q K 
since w(e*) = N — K. This contradiction proves the corollary ■ 

The memoryless erasure channel obviously satisfies the condition in ©. Combining Proposition I and 
Corollary I results in Corollary EL 

Corollary II. A block code of size [N, K] with equiprobable codewords over a memoryless erasure 
channel has the minimum probability of error (assuming optimum, i.e., maximum likelihood decoding) 
among all block codes of the same size iff that code is Maximum Distance Separable (MDS). 

A. MDS codes with Suboptimal Decoding 

In the proof of proposition I, it is assumed that the received codewords are decoded based on maximum 
likelihood decoding which is optimum in this case. However, in many practical cases, MDS codes are 
decoded by simpler decoders [28]. Such suboptimal decoders can perfectly reconstruct the codewords of a 
[N, K] codebook if they receive K or more symbols correctly. In case more than N—K symbols are erased, 
a decoding error occurs. Let P^^i denote the probability of this event. -P| f £f is obviously different from 
the decoding error probability of the maximum likelihood decoder denoted by Peml- Theoretically, an 
optimum maximum likelihood decoder of an MDS code may still decode the original codeword correctly 
with a positive, but small probability, if it receives less than K symbols. More precisely, according to 
the proof of Proposition I, such a decoder is able to correctly decode an MDS code over ¥ q with the 
probability of \ after receiving K — i correct symbols. Of course, for Galois fields with large cardinality, 
this probability is usually negligible. The relationship between P^^i an d Pe,ml can be summarized as 



follows 



K 



F{K — i symbols received correctly} 



P 



■>MDS _ pRIDS 
E,ML — r E,sub 



E 



A 



F{K — i symbols received 



tly} 



\ id MDS 
— r E,sub 





(6) 



Hence, P^ml * s bounded as 




(7) 
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IV. Error Exponents of MDS, Random, and Linear Random Codes 

A. Error Exponent of MDS Codes over a Memoryless Erasure Channel 

Consider a block code of size [N, K] over the memoryless erasure channel of Fig. \T\ Let a = N ~ K 
denote the coding overhead. For a g-ary [N, K) code, the rate per symbol, R, is equal to 

K 

R = ^logg= (l-ajlogg. (8) 

In a block code of length N, the number of lost symbols would be J2iLi e « where is defined in 
Proposition I. Thus, the probability of decoding error for the suboptimal decoder of subsection IIII-AI can 
be written as 



pMDS _ m 
r E,sub — r 



^ N "| K-l 



i=l ) i=0 

where Pi denotes the probability that i symbols are received correctly. Since e/s are i.i.d random variables 
with Bernoulli distribution, we have Pi — (1 — tt) % tt^ - ^ ). It is easy to see that 

P L=l N-i + l) { l-*) >l forl=h ... tK _ 1 (l0) 

Pi-i m 

if a = N ^ K > 7T. According to equation ([8]), the condition a > n can be rewritten as R < (1 — n) log q = 
C where C is the capacity of the memoryless erasure channel. Therefore, the summation terms in 
equation © are always increasing, and the largest term is the last one. Now, we can bound Pjf^l 
as Pk-i < < KPk-i- The term in Pk-i can be bounded using the fact that for any 

iV > K > 0, we have [29] 

' ,M# ) < ( N ) < e ^(f ) (11) 



N + l ~ \K 

where the entropy, H (j^), is computed in nats. Thus, P^^l 1S bounded as 



tt(1 - a )Ne-»«M pMDS ^(1 - afN 2 e- N ^ 



(1 - n)(N + l)(aN + I)' *>" ~ (1 - 7t)(aN + 1) 
where is defined as 



for a < 7r 

alog-' "' 1 -^ < 13) 



w a = < 

o lot; I 

^(1 - a) 

/l — 7r\ 

— log for 7T < a < 1. 

\l-aj 

with the log functions computed in the Neperian base. 

Using equation ©, the MDS coding error exponent, «(.), can be expressed in terms of R instead of a. 
In ([8]), K should be an integer, and we should have q + 1 > iV for a feasible MDS code. Thus, the finest 
resolution of rates achievable by a single MDS codebook would be R = log q for i = 1, 2, . . . , q. Of 
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course, it is also possible to achieve the rates in the intervals log q < R < ^ log q by time sharing 
between two MDS codebooks of sizes [q + and [q + l,i + 1]. However, in such cases, the smaller 
error exponent belonging to the codebook of the size [q+ + 1] dominates. Therefore, u(R) will have 
a stepwise shape of the form 

for 1 — 7r < r 



u(R) 



-r log ■ 
-log 



l-7rHl-f1 



(14) 



7T 



TIT 



where f is defined as 



1 — r 



q + l 



for < r < 1 — 7r 



hgq 



(15) 



B. Random Coding Error Exponent of a Memoryless Erasure Channel 

It is interesting to compare the error exponent in (PT41) with the random coding error exponent as 
described in [6]. This exponent, E r (R), can be written as 

(16) 



EJR) = max < —pR + ma.xE Q (p, Q) 
o<p<i \ Q 



where Q is the input distribution, and E Q (p, Q) equals 



E {p, Q) = -log 



E 

3=0 



q-1 



52Q(k)P(j\k) 1 + p 



k=0 



1+P 



(17) 



Due to the symmetry of the channel transition probabilities, the uniform distribution maximizes (1161) over 
all possible input distributions. Therefore, E (p, Q) can be simplified as 



1 — 7T 

E Q (p, Q) = - log ( + rr 



qf 



(18) 



Solving the maximization (fT6l) . gives us E r (R) as 



, 1 - vr + irq 
— log r log q 



Q 



for < r < 



Rr 



hgq 



E r {R) = < 



(19) 



(l-7r)(l-r) it 
—r log log 



r7T 



for 



R c 
\ogq 



1 — r 
< r < 1 — 7r 



where r = , and R c = 1 _ 1 7r ^ r 7r logg are the normalized and the critical rates, respectively. 
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(a) (b) 

Fig. 2. Error exponents of random coding (E r (R)) and MDS coding (u(R)) for a memoryless erasure channel with n = 0.015, and (a): 
q = 128, (b): q = 1024. 



Comparing (|T4l) and (U9I) , we observe that the MDS codes and the random codes perform exponentially 
the same for rates between the critical rate and the capacity. However, for the region below the critical 
rate, where the error exponent of the random code decays linearly with R, MDS codes achieve a larger 
error exponent. It is worth noting that this interval is negligible for large alphabet sizes. Moreover, the 
stepwise graph of u(R) meets its envelope as the steps are very small for large values of q. 

Figure [2] depicts the error exponents of random codes and MDS codes for the alphabet sizes of q = 128 
and q = 1024 over an erasure channel with n = 0.015. As observed in Fig. [2(a)} u(R) can be approximated 
by its envelope very closely even for a relatively small alphabet size (q = 128). For a larger alphabet size 
(Fig. |2(b)[ ), the graph of u(R) almost coincides its envelope which equals E r (R) for the region above the 
critical rate. Moreover, as observed in Fig. |2(b)[ the region where MDS codes outperform random codes 
becomes very small even for moderate values of alphabet size (q = 1024). 



C. Linear Random Coding Error Exponent of a Memoryless Erasure Channel 

Maximum likelihood decoding of random codes generally has exponential complexity in terms of the 
block length (N). Linear random codes, on the other hand, have the advantage of polynomial decoding 
complexity (assuming maximum likelihood decoding) over any arbitrary erasure channel [30]. In a linear 
codebook of size [N, K), any codeword, c, can be written as c = bG, where b is a row vector of length 
K, and indicates the information symbols. G is the generator matrix of size K x N. In the case of a linear 
random codebook, every element in G is generated independently according to a distribution Q [10], [11]. 
For a memoryless erasure channel, due to the symmetry of the channel transition probabilities, the uniform 
distribution is applied to generate G. 
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Here, we describe a suboptimal decoder with polynomial complexity for decoding of linear block codes 
over erasure channels. This decoder is a slightly modified version of the optimum (maximum likelihood) 
decoder in [30]. In case that less than K symbols are received correctly, a decoding error is declared. 
When K or more correct symbols are received, the decoder determines the information vector b (and the 
transmitted codeword c) by constructing a new matrix called the reduced generator matrix, G. G consists 
of the columns in G whose corresponding symbols are received correctly. Thus, if the erasure identifier 
vector e has the weight of w(e) = m < N — K, G would have the size of K x (N — m). Then, the decoder 
computes the row or column rank of G. If this rank is less than K, a decoding error is reported. In case 
the rank is equal to K, the information symbol vector can be decoded uniquely by solving bG = y. In 
this case, y is the reduced received vector consisting of the correctly received symbols only. 

Using the described suboptimal decoder, the probability of error is the probability that the rank of G 
is less than K. Thus, the probability of error conditioned on an erasure vector of weight w(e) = m can 
be written as [31] 

N-m , . 

P {error|w(e) = m} = 1 - [ [ ( 1 - — J . (20) 

i=N-m-K+l ^ ^ ' 

We bound the above probability as 

( 1 V 

P {errorHe) = m} < 1 - ^1 - qN _ m _ K+1 j 

(a) K 

— qN-m-K+l ^ ' 

where (a) follows from Bernoulli's inequality [32] and the assumption that w(e) = m < N — K. The 
total probability of error is written as 

K-l N 



Pe,^ = E P * + E P * P {errorHe) =iV-z} 

i=0 i=K 
(a) ™ n » ™ 



a o-K+l 
i=0 i=K H 

K-2 N 

= y, p *+Qk-i+ k Y.& (22) 

i=0 i=K 

where P, L denotes the probability that i symbols are received correctly as defined in subsection IIV-AI and 

Pi 

Qi = i-K+i ■ ( a ) f 01 l° ws f rom <EB- 

We define io as i = ^ N -^^~^ ■ Of course, io is not necessarily an integer. For the case where z'o < K, 
similar to equation (TlOl) . we can write 



Qi-i gin 
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Thus, Qi's are decreasing, and we have 



(<•) 



K-l 



p 1 e% < J2 P * + K ( N - K+1 ^«- 



i=0 



(6) 



< {N-K + 2)KP K . 1 
to 7rK 2 (iV — if + 2) 
(1 -n)(N-K+ if 
nN 2 r 2 (N - Nr + 2) 



< 



-NE r (R) 



-NEr(R) 



(24) 



(l-7r)(JV-iVr + l) 

where (a) follows from (|23l) and (1221) . (6) results from (flOl ), and (c) is based on (fTTT) and ©. The condition 
io < K can also be rewritten as (l + < r where r = as in (fT9l ). 

For the case where K < i , according to equation (|23l ), the series of {Qi\ i= K-i has its maximum at 
i* = |_i J > K. Thus, we have 

, > K-l 
(a) 



p 1 e% < J2 P * + K ( N ~ K + 1 ^ 



j=0 



(6) 



< (N-K + 2)KQi* 

(c) 

< (N - K + 2)Kexp 



< (N - K + 2)Kexp 



(d) 



-N 



( 



-N 



N 



log 



N 



r 
N 



log- 



7T 



(1-tt) 
? - 1 



-flog? 



AT 



log 



A 7 " 



-g7r 



V 



N 



-log 



7T 



K 



(1-tt) 



i _ !i ^ 



logg 



- - + 2)A^re-^ ( ' R ' 7V) 
where exp(x) = e x , and f (R, N) is defined as below 

v(R, N) 



(25) 



Ml - 7r) - 7rg , Ml-vr)-™ , vrMl - vr + vrg) 

l°g 7T7 w r — l°g — ~ : R 



N(l-TT + nq) ° (N+ 1)(1 -tt) 

-log H --R + - g. 

g A/ 



A%g — 1 + 7r 



(26) 



In (1231 ), (a) follows from (|23l ) and (|22l) . and (6) results from (fTQb ■ (c) is based on (fTTT) . and can derived 
similar to (TT2l) . (<f) follows from ([8]). Combining (1241) and (|25T) results in 

log — ^± ^ -^ + 0(1)^ for R //, ( I - y ) 



pZin ^ / 



(AT — r A" + 2) ATr exp ( -JV 
vriW^ - Nr + 2 



(1 -ir)(N- Nr + 1) 



exp(-AT£ r (i?)) 



for R>R c (l + j ! ) 

(27) 
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D, Exponential Optimality of Random Coding and Linear Random Coding 

Using the sphere packing bound, it is shown that random coding is exponentially optimal for the rates 
above the critical rate over channels with relatively small alphabet sizes (q <C N) [8], [9]. In other words, 
we know that 

P%%l = e- NEr(R) (28) 
logP£°$ L 

where the notation = means lim '■ — = EJR). However, the sphere packing bound is not tight 

iV->oo iV 

for the channels whose alphabet size, q, is comparable to the block length. Here, based on Proposition I 
and the results of section [TV] we prove the exponential optimality of random coding and linear random 
coding over the erasure channels for all block sizes (both N > q + 1 and N < q + 1). 

The average decoding error probability for an ensemble of random codebooks with the maximum- 
likelihood decoding can be upper bounded as 

prand ( < } e ~NE r (R) (J e ~Nu{R) ^9) 

where (a) follows from [6], and (b) is valid only for rates above the critical rate according to (fl4l) and (fT9l) . 
The similar upper-bound for Pjf^, is given in (l24l) . 
We can also lower bound P y ^f 1L and -Pg^A as 

(a) 

prand \ pMDS 
r E,ML — E,ML 

1 E,sub 



^ (l 1 ) 



" (1-tt) (N + l)((l-r)N + l) (30) 
where (a) follows from Proposition I and ©, (b) from inequality ©, and (c) from inequality (fT2l) . The 

inequality in d30l) remains valid if Pe^jl is replaced by P 1 ^^- 

Combining (|29l) and d30l) guarantees that both the upper-bound and the lower-bound on Pe"ml arQ 

exponentially tight, and the decaying exponent of P'^ml versus iV is indeed u(R). Combining (T24l) 

and (l30l) proves the same result about the exponent of P^lub versus N. Moreover, we can write 

p mds « W (l-7r)(iV+l)(iV-riV + l) 



A TtrN 



p MDS^ p nn <$ nNr(N+l)(N-rN + 2) MDS 

where (a) follows from Proposition I and ©, and (6) results from inequalities (|29l ) and (|30l) . (c) is based 
on CO), (fT2l) . and (T25T) . Since the coefficients of Peml m <l3TT) do not include any exponential terms, it can 
be concluded that for rates above the critical rate, both random codes and linear random codes perform 
exponentially the same as MDS codes, which are already shown to be optimum. 
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V. Conclusion 

Performance of random codes, linear random codes, and MDS codes over an erasure channel with a 
fixed, but large alphabet size is analyzed. We proved that MDS codes minimize the probability of decoding 
error (using maximum-likelihood decoding) over any erasure channel (with or without memory). Then, 
the decoding error probability of MDS codes, random codes, and linear random codes are bounded by 
exponential terms, and the corresponding exponents are compared. It is observed that the error exponents 
are identical over a wide range of rates. Knowing MDS codes are optimum, it is concluded that both 
random coding and linear random coding are exponentially optimal over a memoryless erasure channel 
for all block sizes (whether iV>g+loriV<g + l). 
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